Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 205 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 41.8 KiB |
| Average record size in memory | 208.6 B |
Variable types
| CAT | 16 |
|---|---|
| NUM | 10 |
Reproduction
| Analysis started | 2020-08-24 08:39:52.254552 |
|---|---|
| Analysis finished | 2020-08-24 08:40:11.455307 |
| Duration | 19.2 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
normalized-losses has a high cardinality: 52 distinct values | High cardinality |
horsepower has a high cardinality: 60 distinct values | High cardinality |
price has a high cardinality: 187 distinct values | High cardinality |
highway-mpg is highly correlated with city-mpg | High correlation |
city-mpg is highly correlated with highway-mpg | High correlation |
fuel-system is highly correlated with fuel-type | High correlation |
fuel-type is highly correlated with fuel-system | High correlation |
bore is highly correlated with engine-location | High correlation |
engine-location is highly correlated with bore and 2 other fields | High correlation |
stroke is highly correlated with engine-location | High correlation |
peak-rpm is highly correlated with engine-location | High correlation |
price is uniformly distributed | Uniform |
symboling has 67 (32.7%) zeros | Zeros |
| Distinct count | 6 |
|---|---|
| Unique (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8341463414634146 |
|---|---|
| Minimum | -2 |
| Maximum | 3 |
| Zeros | 67 |
| Zeros (%) | 32.7% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 3 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.245306828 |
|---|---|
| Coefficient of variation (CV) | 1.492911695 |
| Kurtosis | -0.6762713562 |
| Mean | 0.8341463415 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.2110722721 |
| Sum | 171 |
| Variance | 1.550789096 |
| Value | Count | Frequency (%) | |
| 0 | 67 | 32.7% | |
| 1 | 54 | 26.3% | |
| 2 | 32 | 15.6% | |
| 3 | 27 | 13.2% | |
| -1 | 22 | 10.7% | |
| -2 | 3 | 1.5% |
| Value | Count | Frequency (%) | |
| -2 | 3 | 1.5% | |
| -1 | 22 | 10.7% | |
| 0 | 67 | 32.7% | |
| 1 | 54 | 26.3% | |
| 2 | 32 | 15.6% |
| Value | Count | Frequency (%) | |
| 3 | 27 | 13.2% | |
| 2 | 32 | 15.6% | |
| 1 | 54 | 26.3% | |
| 0 | 67 | 32.7% | |
| -1 | 22 | 10.7% |
| Distinct count | 52 |
|---|---|
| Unique (%) | 25.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| ? | |
|---|---|
| 161 | 11 |
| 91 | 8 |
| 150 | 7 |
| 134 | 6 |
| Other values (47) |
| Value | Count | Frequency (%) | |
| ? | 41 | 20.0% | |
| 161 | 11 | 5.4% | |
| 91 | 8 | 3.9% | |
| 150 | 7 | 3.4% | |
| 134 | 6 | 2.9% | |
| 104 | 6 | 2.9% | |
| 128 | 6 | 2.9% | |
| 94 | 5 | 2.4% | |
| 102 | 5 | 2.4% | |
| 85 | 5 | 2.4% | |
| Other values (42) | 105 | 51.2% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.356097561 |
| Min length | 1 |
make
Categorical
| Distinct count | 22 |
|---|---|
| Unique (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| toyota | |
|---|---|
| nissan | 18 |
| mazda | 17 |
| honda | 13 |
| mitsubishi | 13 |
| Other values (17) |
| Value | Count | Frequency (%) | |
| toyota | 32 | 15.6% | |
| nissan | 18 | 8.8% | |
| mazda | 17 | 8.3% | |
| honda | 13 | 6.3% | |
| mitsubishi | 13 | 6.3% | |
| subaru | 12 | 5.9% | |
| volkswagen | 12 | 5.9% | |
| peugot | 11 | 5.4% | |
| volvo | 11 | 5.4% | |
| dodge | 9 | 4.4% | |
| Other values (12) | 57 | 27.8% |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.47804878 |
| Min length | 3 |
| Distinct count | 2 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| gas | |
|---|---|
| diesel | 20 |
| Value | Count | Frequency (%) | |
| gas | 185 | 90.2% | |
| diesel | 20 | 9.8% |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.292682927 |
| Min length | 3 |
aspiration
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| std | |
|---|---|
| turbo |
| Value | Count | Frequency (%) | |
| std | 168 | 82.0% | |
| turbo | 37 | 18.0% |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.36097561 |
| Min length | 3 |
num-of-doors
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| four | |
|---|---|
| two | |
| ? | 2 |
| Value | Count | Frequency (%) | |
| four | 114 | 55.6% | |
| two | 89 | 43.4% | |
| ? | 2 | 1.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.536585366 |
| Min length | 1 |
body-style
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| sedan | |
|---|---|
| hatchback | |
| wagon | |
| hardtop | 8 |
| convertible | 6 |
| Value | Count | Frequency (%) | |
| sedan | 96 | 46.8% | |
| hatchback | 70 | 34.1% | |
| wagon | 25 | 12.2% | |
| hardtop | 8 | 3.9% | |
| convertible | 6 | 2.9% |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 6.619512195 |
| Min length | 5 |
drive-wheels
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| fwd | |
|---|---|
| rwd | |
| 4wd | 9 |
| Value | Count | Frequency (%) | |
| fwd | 120 | 58.5% | |
| rwd | 76 | 37.1% | |
| 4wd | 9 | 4.4% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct count | 2 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| front | |
|---|---|
| rear | 3 |
| Value | Count | Frequency (%) | |
| front | 202 | 98.5% | |
| rear | 3 | 1.5% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.985365854 |
| Min length | 4 |
wheel-base
Real number (ℝ≥0)
| Distinct count | 53 |
|---|---|
| Unique (%) | 25.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.75658536585367 |
|---|---|
| Minimum | 86.6 |
| Maximum | 120.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 86.6 |
|---|---|
| 5-th percentile | 93.02 |
| Q1 | 94.5 |
| median | 97 |
| Q3 | 102.4 |
| 95-th percentile | 110 |
| Maximum | 120.9 |
| Range | 34.3 |
| Interquartile range (IQR) | 7.9 |
Descriptive statistics
| Standard deviation | 6.021775685 |
|---|---|
| Coefficient of variation (CV) | 0.06097594062 |
| Kurtosis | 1.017038946 |
| Mean | 98.75658537 |
| Median Absolute Deviation (MAD) | 2.7 |
| Skewness | 1.050213776 |
| Sum | 20245.1 |
| Variance | 36.2617824 |
| Value | Count | Frequency (%) | |
| 94.5 | 21 | 10.2% | |
| 93.7 | 20 | 9.8% | |
| 95.7 | 13 | 6.3% | |
| 96.5 | 8 | 3.9% | |
| 98.4 | 7 | 3.4% | |
| 97.3 | 7 | 3.4% | |
| 96.3 | 6 | 2.9% | |
| 107.9 | 6 | 2.9% | |
| 98.8 | 6 | 2.9% | |
| 99.1 | 6 | 2.9% | |
| Other values (43) | 105 | 51.2% |
| Value | Count | Frequency (%) | |
| 86.6 | 2 | 1.0% | |
| 88.4 | 1 | 0.5% | |
| 88.6 | 2 | 1.0% | |
| 89.5 | 3 | 1.5% | |
| 91.3 | 2 | 1.0% |
| Value | Count | Frequency (%) | |
| 120.9 | 1 | 0.5% | |
| 115.6 | 2 | 1.0% | |
| 114.2 | 4 | 2.0% | |
| 113 | 2 | 1.0% | |
| 112 | 1 | 0.5% |
length
Real number (ℝ≥0)
| Distinct count | 75 |
|---|---|
| Unique (%) | 36.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 174.04926829268288 |
|---|---|
| Minimum | 141.1 |
| Maximum | 208.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 141.1 |
|---|---|
| 5-th percentile | 157.14 |
| Q1 | 166.3 |
| median | 173.2 |
| Q3 | 183.1 |
| 95-th percentile | 196.36 |
| Maximum | 208.1 |
| Range | 67 |
| Interquartile range (IQR) | 16.8 |
Descriptive statistics
| Standard deviation | 12.33728853 |
|---|---|
| Coefficient of variation (CV) | 0.0708838862 |
| Kurtosis | -0.08289485345 |
| Mean | 174.0492683 |
| Median Absolute Deviation (MAD) | 6.9 |
| Skewness | 0.1559537713 |
| Sum | 35680.1 |
| Variance | 152.2086882 |
| Value | Count | Frequency (%) | |
| 157.3 | 15 | 7.3% | |
| 188.8 | 11 | 5.4% | |
| 166.3 | 7 | 3.4% | |
| 171.7 | 7 | 3.4% | |
| 186.7 | 7 | 3.4% | |
| 165.3 | 6 | 2.9% | |
| 177.8 | 6 | 2.9% | |
| 176.2 | 6 | 2.9% | |
| 186.6 | 6 | 2.9% | |
| 176.8 | 5 | 2.4% | |
| Other values (65) | 129 | 62.9% |
| Value | Count | Frequency (%) | |
| 141.1 | 1 | 0.5% | |
| 144.6 | 2 | 1.0% | |
| 150 | 3 | 1.5% | |
| 155.9 | 3 | 1.5% | |
| 156.9 | 1 | 0.5% |
| Value | Count | Frequency (%) | |
| 208.1 | 1 | 0.5% | |
| 202.6 | 2 | 1.0% | |
| 199.6 | 2 | 1.0% | |
| 199.2 | 1 | 0.5% | |
| 198.9 | 4 | 2.0% |
width
Real number (ℝ≥0)
| Distinct count | 44 |
|---|---|
| Unique (%) | 21.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.90780487804878 |
|---|---|
| Minimum | 60.3 |
| Maximum | 72.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 60.3 |
|---|---|
| 5-th percentile | 63.6 |
| Q1 | 64.1 |
| median | 65.5 |
| Q3 | 66.9 |
| 95-th percentile | 70.46 |
| Maximum | 72.3 |
| Range | 12 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.145203853 |
|---|---|
| Coefficient of variation (CV) | 0.03254855562 |
| Kurtosis | 0.7027642441 |
| Mean | 65.90780488 |
| Median Absolute Deviation (MAD) | 1.4 |
| Skewness | 0.9040034988 |
| Sum | 13511.1 |
| Variance | 4.60189957 |
| Value | Count | Frequency (%) | |
| 63.8 | 24 | 11.7% | |
| 66.5 | 23 | 11.2% | |
| 65.4 | 15 | 7.3% | |
| 63.6 | 11 | 5.4% | |
| 64.4 | 10 | 4.9% | |
| 68.4 | 10 | 4.9% | |
| 64 | 9 | 4.4% | |
| 65.5 | 8 | 3.9% | |
| 65.2 | 7 | 3.4% | |
| 66.3 | 6 | 2.9% | |
| Other values (34) | 82 | 40.0% |
| Value | Count | Frequency (%) | |
| 60.3 | 1 | 0.5% | |
| 61.8 | 1 | 0.5% | |
| 62.5 | 1 | 0.5% | |
| 63.4 | 1 | 0.5% | |
| 63.6 | 11 | 5.4% |
| Value | Count | Frequency (%) | |
| 72.3 | 1 | 0.5% | |
| 72 | 1 | 0.5% | |
| 71.7 | 3 | 1.5% | |
| 71.4 | 3 | 1.5% | |
| 70.9 | 1 | 0.5% |
height
Real number (ℝ≥0)
| Distinct count | 49 |
|---|---|
| Unique (%) | 23.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.72487804878049 |
|---|---|
| Minimum | 47.8 |
| Maximum | 59.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 47.8 |
|---|---|
| 5-th percentile | 49.7 |
| Q1 | 52 |
| median | 54.1 |
| Q3 | 55.5 |
| 95-th percentile | 57.5 |
| Maximum | 59.8 |
| Range | 12 |
| Interquartile range (IQR) | 3.5 |
Descriptive statistics
| Standard deviation | 2.44352197 |
|---|---|
| Coefficient of variation (CV) | 0.04548213153 |
| Kurtosis | -0.4438123651 |
| Mean | 53.72487805 |
| Median Absolute Deviation (MAD) | 1.6 |
| Skewness | 0.06312273247 |
| Sum | 11013.6 |
| Variance | 5.970799617 |
| Value | Count | Frequency (%) | |
| 50.8 | 14 | 6.8% | |
| 52 | 12 | 5.9% | |
| 55.7 | 12 | 5.9% | |
| 54.5 | 10 | 4.9% | |
| 54.1 | 10 | 4.9% | |
| 55.5 | 9 | 4.4% | |
| 56.7 | 8 | 3.9% | |
| 54.3 | 8 | 3.9% | |
| 51.6 | 7 | 3.4% | |
| 56.1 | 7 | 3.4% | |
| Other values (39) | 108 | 52.7% |
| Value | Count | Frequency (%) | |
| 47.8 | 1 | 0.5% | |
| 48.8 | 2 | 1.0% | |
| 49.4 | 2 | 1.0% | |
| 49.6 | 4 | 2.0% | |
| 49.7 | 3 | 1.5% |
| Value | Count | Frequency (%) | |
| 59.8 | 2 | 1.0% | |
| 59.1 | 3 | 1.5% | |
| 58.7 | 4 | 2.0% | |
| 58.3 | 1 | 0.5% | |
| 57.5 | 3 | 1.5% |
curb-weight
Real number (ℝ≥0)
| Distinct count | 171 |
|---|---|
| Unique (%) | 83.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2555.5658536585365 |
|---|---|
| Minimum | 1488 |
| Maximum | 4066 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 1488 |
|---|---|
| 5-th percentile | 1901 |
| Q1 | 2145 |
| median | 2414 |
| Q3 | 2935 |
| 95-th percentile | 3503 |
| Maximum | 4066 |
| Range | 2578 |
| Interquartile range (IQR) | 790 |
Descriptive statistics
| Standard deviation | 520.6802035 |
|---|---|
| Coefficient of variation (CV) | 0.2037436064 |
| Kurtosis | -0.0428537661 |
| Mean | 2555.565854 |
| Median Absolute Deviation (MAD) | 386 |
| Skewness | 0.6813981891 |
| Sum | 523891 |
| Variance | 271107.8743 |
| Value | Count | Frequency (%) | |
| 2385 | 4 | 2.0% | |
| 1989 | 3 | 1.5% | |
| 1918 | 3 | 1.5% | |
| 2275 | 3 | 1.5% | |
| 3230 | 2 | 1.0% | |
| 2410 | 2 | 1.0% | |
| 3252 | 2 | 1.0% | |
| 2337 | 2 | 1.0% | |
| 2403 | 2 | 1.0% | |
| 2414 | 2 | 1.0% | |
| Other values (161) | 180 | 87.8% |
| Value | Count | Frequency (%) | |
| 1488 | 1 | 0.5% | |
| 1713 | 1 | 0.5% | |
| 1819 | 1 | 0.5% | |
| 1837 | 1 | 0.5% | |
| 1874 | 2 | 1.0% |
| Value | Count | Frequency (%) | |
| 4066 | 2 | 1.0% | |
| 3950 | 1 | 0.5% | |
| 3900 | 1 | 0.5% | |
| 3770 | 1 | 0.5% | |
| 3750 | 1 | 0.5% |
engine-type
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| ohc | |
|---|---|
| ohcf | 15 |
| ohcv | 13 |
| dohc | 12 |
| l | 12 |
| Other values (2) | 5 |
| Value | Count | Frequency (%) | |
| ohc | 148 | 72.2% | |
| ohcf | 15 | 7.3% | |
| ohcv | 13 | 6.3% | |
| dohc | 12 | 5.9% | |
| l | 12 | 5.9% | |
| rotor | 4 | 2.0% | |
| dohcv | 1 | 0.5% |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.126829268 |
| Min length | 1 |
num-of-cylinders
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| four | |
|---|---|
| six | 24 |
| five | 11 |
| eight | 5 |
| two | 4 |
| Other values (2) | 2 |
| Value | Count | Frequency (%) | |
| four | 159 | 77.6% | |
| six | 24 | 11.7% | |
| five | 11 | 5.4% | |
| eight | 5 | 2.4% | |
| two | 4 | 2.0% | |
| twelve | 1 | 0.5% | |
| three | 1 | 0.5% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 3.902439024 |
| Min length | 3 |
engine-size
Real number (ℝ≥0)
| Distinct count | 44 |
|---|---|
| Unique (%) | 21.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.90731707317073 |
|---|---|
| Minimum | 61 |
| Maximum | 326 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 61 |
|---|---|
| 5-th percentile | 90 |
| Q1 | 97 |
| median | 120 |
| Q3 | 141 |
| 95-th percentile | 201.2 |
| Maximum | 326 |
| Range | 265 |
| Interquartile range (IQR) | 44 |
Descriptive statistics
| Standard deviation | 41.64269344 |
|---|---|
| Coefficient of variation (CV) | 0.3281346923 |
| Kurtosis | 5.305682092 |
| Mean | 126.9073171 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 1.947655045 |
| Sum | 26016 |
| Variance | 1734.113917 |
| Value | Count | Frequency (%) | |
| 122 | 15 | 7.3% | |
| 92 | 15 | 7.3% | |
| 98 | 14 | 6.8% | |
| 97 | 14 | 6.8% | |
| 108 | 13 | 6.3% | |
| 90 | 12 | 5.9% | |
| 110 | 12 | 5.9% | |
| 109 | 8 | 3.9% | |
| 120 | 7 | 3.4% | |
| 141 | 7 | 3.4% | |
| Other values (34) | 88 | 42.9% |
| Value | Count | Frequency (%) | |
| 61 | 1 | 0.5% | |
| 70 | 3 | 1.5% | |
| 79 | 1 | 0.5% | |
| 80 | 1 | 0.5% | |
| 90 | 12 | 5.9% |
| Value | Count | Frequency (%) | |
| 326 | 1 | 0.5% | |
| 308 | 1 | 0.5% | |
| 304 | 1 | 0.5% | |
| 258 | 2 | 1.0% | |
| 234 | 2 | 1.0% |
| Distinct count | 8 |
|---|---|
| Unique (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| mpfi | |
|---|---|
| 2bbl | |
| idi | |
| 1bbl | 11 |
| spdi | 9 |
| Other values (3) | 5 |
| Value | Count | Frequency (%) | |
| mpfi | 94 | 45.9% | |
| 2bbl | 66 | 32.2% | |
| idi | 20 | 9.8% | |
| 1bbl | 11 | 5.4% | |
| spdi | 9 | 4.4% | |
| 4bbl | 3 | 1.5% | |
| spfi | 1 | 0.5% | |
| mfi | 1 | 0.5% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.897560976 |
| Min length | 3 |
| Distinct count | 39 |
|---|---|
| Unique (%) | 19.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| 3.62 | 23 |
|---|---|
| 3.19 | 20 |
| 3.15 | 15 |
| 3.03 | 12 |
| 2.97 | 12 |
| Other values (34) |
| Value | Count | Frequency (%) | |
| 3.62 | 23 | 11.2% | |
| 3.19 | 20 | 9.8% | |
| 3.15 | 15 | 7.3% | |
| 3.03 | 12 | 5.9% | |
| 2.97 | 12 | 5.9% | |
| 3.46 | 9 | 4.4% | |
| 3.78 | 8 | 3.9% | |
| 3.31 | 8 | 3.9% | |
| 3.43 | 8 | 3.9% | |
| 2.91 | 7 | 3.4% | |
| Other values (29) | 83 | 40.5% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.892682927 |
| Min length | 1 |
| Distinct count | 37 |
|---|---|
| Unique (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| 3.4 | 20 |
|---|---|
| 3.23 | 14 |
| 3.03 | 14 |
| 3.15 | 14 |
| 3.39 | 13 |
| Other values (32) |
| Value | Count | Frequency (%) | |
| 3.4 | 20 | 9.8% | |
| 3.23 | 14 | 6.8% | |
| 3.03 | 14 | 6.8% | |
| 3.15 | 14 | 6.8% | |
| 3.39 | 13 | 6.3% | |
| 2.64 | 11 | 5.4% | |
| 3.35 | 9 | 4.4% | |
| 3.29 | 9 | 4.4% | |
| 3.46 | 8 | 3.9% | |
| 3.19 | 6 | 2.9% | |
| Other values (27) | 87 | 42.4% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.765853659 |
| Min length | 1 |
compression-ratio
Real number (ℝ≥0)
| Distinct count | 32 |
|---|---|
| Unique (%) | 15.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.142536585365855 |
|---|---|
| Minimum | 7.0 |
| Maximum | 23.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 7.5 |
| Q1 | 8.6 |
| median | 9 |
| Q3 | 9.4 |
| 95-th percentile | 21.82 |
| Maximum | 23 |
| Range | 16 |
| Interquartile range (IQR) | 0.8 |
Descriptive statistics
| Standard deviation | 3.972040322 |
|---|---|
| Coefficient of variation (CV) | 0.3916219861 |
| Kurtosis | 5.233054348 |
| Mean | 10.14253659 |
| Median Absolute Deviation (MAD) | 0.4 |
| Skewness | 2.610862458 |
| Sum | 2079.22 |
| Variance | 15.77710432 |
| Value | Count | Frequency (%) | |
| 9 | 46 | 22.4% | |
| 9.4 | 26 | 12.7% | |
| 8.5 | 14 | 6.8% | |
| 9.5 | 13 | 6.3% | |
| 9.3 | 11 | 5.4% | |
| 8.7 | 9 | 4.4% | |
| 9.2 | 8 | 3.9% | |
| 8 | 8 | 3.9% | |
| 7 | 7 | 3.4% | |
| 21 | 5 | 2.4% | |
| Other values (22) | 58 | 28.3% |
| Value | Count | Frequency (%) | |
| 7 | 7 | 3.4% | |
| 7.5 | 5 | 2.4% | |
| 7.6 | 4 | 2.0% | |
| 7.7 | 2 | 1.0% | |
| 7.8 | 1 | 0.5% |
| Value | Count | Frequency (%) | |
| 23 | 5 | 2.4% | |
| 22.7 | 1 | 0.5% | |
| 22.5 | 3 | 1.5% | |
| 22 | 1 | 0.5% | |
| 21.9 | 1 | 0.5% |
| Distinct count | 60 |
|---|---|
| Unique (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| 68 | 19 |
|---|---|
| 70 | 11 |
| 69 | 10 |
| 116 | 9 |
| 110 | 8 |
| Other values (55) |
| Value | Count | Frequency (%) | |
| 68 | 19 | 9.3% | |
| 70 | 11 | 5.4% | |
| 69 | 10 | 4.9% | |
| 116 | 9 | 4.4% | |
| 110 | 8 | 3.9% | |
| 95 | 7 | 3.4% | |
| 62 | 6 | 2.9% | |
| 114 | 6 | 2.9% | |
| 160 | 6 | 2.9% | |
| 101 | 6 | 2.9% | |
| Other values (50) | 117 | 57.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.448780488 |
| Min length | 1 |
| Distinct count | 24 |
|---|---|
| Unique (%) | 11.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| 5500 | |
|---|---|
| 4800 | |
| 5000 | |
| 5200 | |
| 5400 | 13 |
| Other values (19) |
| Value | Count | Frequency (%) | |
| 5500 | 37 | 18.0% | |
| 4800 | 36 | 17.6% | |
| 5000 | 27 | 13.2% | |
| 5200 | 23 | 11.2% | |
| 5400 | 13 | 6.3% | |
| 6000 | 9 | 4.4% | |
| 5250 | 7 | 3.4% | |
| 4500 | 7 | 3.4% | |
| 5800 | 7 | 3.4% | |
| 4200 | 5 | 2.4% | |
| Other values (14) | 34 | 16.6% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.970731707 |
| Min length | 1 |
| Distinct count | 29 |
|---|---|
| Unique (%) | 14.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.21951219512195 |
|---|---|
| Minimum | 13 |
| Maximum | 49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 19 |
| median | 24 |
| Q3 | 30 |
| 95-th percentile | 37 |
| Maximum | 49 |
| Range | 36 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.542141653 |
|---|---|
| Coefficient of variation (CV) | 0.2594079379 |
| Kurtosis | 0.5786483405 |
| Mean | 25.2195122 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.6637040288 |
| Sum | 5170 |
| Variance | 42.79961741 |
| Value | Count | Frequency (%) | |
| 31 | 28 | 13.7% | |
| 19 | 27 | 13.2% | |
| 24 | 22 | 10.7% | |
| 27 | 14 | 6.8% | |
| 17 | 13 | 6.3% | |
| 26 | 12 | 5.9% | |
| 23 | 12 | 5.9% | |
| 21 | 8 | 3.9% | |
| 30 | 8 | 3.9% | |
| 25 | 8 | 3.9% | |
| Other values (19) | 53 | 25.9% |
| Value | Count | Frequency (%) | |
| 13 | 1 | 0.5% | |
| 14 | 2 | 1.0% | |
| 15 | 3 | 1.5% | |
| 16 | 6 | 2.9% | |
| 17 | 13 | 6.3% |
| Value | Count | Frequency (%) | |
| 49 | 1 | 0.5% | |
| 47 | 1 | 0.5% | |
| 45 | 1 | 0.5% | |
| 38 | 7 | 3.4% | |
| 37 | 6 | 2.9% |
| Distinct count | 30 |
|---|---|
| Unique (%) | 14.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.75121951219512 |
|---|---|
| Minimum | 16 |
| Maximum | 54 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 25 |
| median | 30 |
| Q3 | 34 |
| 95-th percentile | 42.8 |
| Maximum | 54 |
| Range | 38 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 6.886443131 |
|---|---|
| Coefficient of variation (CV) | 0.2239404889 |
| Kurtosis | 0.4400703815 |
| Mean | 30.75121951 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.5399971879 |
| Sum | 6304 |
| Variance | 47.423099 |
| Value | Count | Frequency (%) | |
| 25 | 19 | 9.3% | |
| 24 | 17 | 8.3% | |
| 38 | 17 | 8.3% | |
| 30 | 16 | 7.8% | |
| 32 | 16 | 7.8% | |
| 34 | 14 | 6.8% | |
| 37 | 13 | 6.3% | |
| 28 | 13 | 6.3% | |
| 29 | 10 | 4.9% | |
| 33 | 9 | 4.4% | |
| Other values (20) | 61 | 29.8% |
| Value | Count | Frequency (%) | |
| 16 | 2 | 1.0% | |
| 17 | 1 | 0.5% | |
| 18 | 2 | 1.0% | |
| 19 | 2 | 1.0% | |
| 20 | 2 | 1.0% |
| Value | Count | Frequency (%) | |
| 54 | 1 | 0.5% | |
| 53 | 1 | 0.5% | |
| 50 | 1 | 0.5% | |
| 47 | 2 | 1.0% | |
| 46 | 2 | 1.0% |
| Distinct count | 187 |
|---|---|
| Unique (%) | 91.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| ? | 4 |
|---|---|
| 7775 | 2 |
| 7609 | 2 |
| 9279 | 2 |
| 18150 | 2 |
| Other values (182) |
| Value | Count | Frequency (%) | |
| ? | 4 | 2.0% | |
| 7775 | 2 | 1.0% | |
| 7609 | 2 | 1.0% | |
| 9279 | 2 | 1.0% | |
| 18150 | 2 | 1.0% | |
| 13499 | 2 | 1.0% | |
| 7295 | 2 | 1.0% | |
| 8495 | 2 | 1.0% | |
| 7898 | 2 | 1.0% | |
| 5572 | 2 | 1.0% | |
| Other values (177) | 183 | 89.3% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.443902439 |
| Min length | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| symboling | normalized-losses | make | fuel-type | aspiration | num-of-doors | body-style | drive-wheels | engine-location | wheel-base | length | width | height | curb-weight | engine-type | num-of-cylinders | engine-size | fuel-system | bore | stroke | compression-ratio | horsepower | peak-rpm | city-mpg | highway-mpg | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | ? | alfa-romero | gas | std | two | convertible | rwd | front | 88.6 | 168.8 | 64.1 | 48.8 | 2548 | dohc | four | 130 | mpfi | 3.47 | 2.68 | 9.0 | 111 | 5000 | 21 | 27 | 13495 |
| 1 | 3 | ? | alfa-romero | gas | std | two | convertible | rwd | front | 88.6 | 168.8 | 64.1 | 48.8 | 2548 | dohc | four | 130 | mpfi | 3.47 | 2.68 | 9.0 | 111 | 5000 | 21 | 27 | 16500 |
| 2 | 1 | ? | alfa-romero | gas | std | two | hatchback | rwd | front | 94.5 | 171.2 | 65.5 | 52.4 | 2823 | ohcv | six | 152 | mpfi | 2.68 | 3.47 | 9.0 | 154 | 5000 | 19 | 26 | 16500 |
| 3 | 2 | 164 | audi | gas | std | four | sedan | fwd | front | 99.8 | 176.6 | 66.2 | 54.3 | 2337 | ohc | four | 109 | mpfi | 3.19 | 3.4 | 10.0 | 102 | 5500 | 24 | 30 | 13950 |
| 4 | 2 | 164 | audi | gas | std | four | sedan | 4wd | front | 99.4 | 176.6 | 66.4 | 54.3 | 2824 | ohc | five | 136 | mpfi | 3.19 | 3.4 | 8.0 | 115 | 5500 | 18 | 22 | 17450 |
| 5 | 2 | ? | audi | gas | std | two | sedan | fwd | front | 99.8 | 177.3 | 66.3 | 53.1 | 2507 | ohc | five | 136 | mpfi | 3.19 | 3.4 | 8.5 | 110 | 5500 | 19 | 25 | 15250 |
| 6 | 1 | 158 | audi | gas | std | four | sedan | fwd | front | 105.8 | 192.7 | 71.4 | 55.7 | 2844 | ohc | five | 136 | mpfi | 3.19 | 3.4 | 8.5 | 110 | 5500 | 19 | 25 | 17710 |
| 7 | 1 | ? | audi | gas | std | four | wagon | fwd | front | 105.8 | 192.7 | 71.4 | 55.7 | 2954 | ohc | five | 136 | mpfi | 3.19 | 3.4 | 8.5 | 110 | 5500 | 19 | 25 | 18920 |
| 8 | 1 | 158 | audi | gas | turbo | four | sedan | fwd | front | 105.8 | 192.7 | 71.4 | 55.9 | 3086 | ohc | five | 131 | mpfi | 3.13 | 3.4 | 8.3 | 140 | 5500 | 17 | 20 | 23875 |
| 9 | 0 | ? | audi | gas | turbo | two | hatchback | 4wd | front | 99.5 | 178.2 | 67.9 | 52.0 | 3053 | ohc | five | 131 | mpfi | 3.13 | 3.4 | 7.0 | 160 | 5500 | 16 | 22 | ? |
Last rows
| symboling | normalized-losses | make | fuel-type | aspiration | num-of-doors | body-style | drive-wheels | engine-location | wheel-base | length | width | height | curb-weight | engine-type | num-of-cylinders | engine-size | fuel-system | bore | stroke | compression-ratio | horsepower | peak-rpm | city-mpg | highway-mpg | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 195 | -1 | 74 | volvo | gas | std | four | wagon | rwd | front | 104.3 | 188.8 | 67.2 | 57.5 | 3034 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 9.5 | 114 | 5400 | 23 | 28 | 13415 |
| 196 | -2 | 103 | volvo | gas | std | four | sedan | rwd | front | 104.3 | 188.8 | 67.2 | 56.2 | 2935 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 9.5 | 114 | 5400 | 24 | 28 | 15985 |
| 197 | -1 | 74 | volvo | gas | std | four | wagon | rwd | front | 104.3 | 188.8 | 67.2 | 57.5 | 3042 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 9.5 | 114 | 5400 | 24 | 28 | 16515 |
| 198 | -2 | 103 | volvo | gas | turbo | four | sedan | rwd | front | 104.3 | 188.8 | 67.2 | 56.2 | 3045 | ohc | four | 130 | mpfi | 3.62 | 3.15 | 7.5 | 162 | 5100 | 17 | 22 | 18420 |
| 199 | -1 | 74 | volvo | gas | turbo | four | wagon | rwd | front | 104.3 | 188.8 | 67.2 | 57.5 | 3157 | ohc | four | 130 | mpfi | 3.62 | 3.15 | 7.5 | 162 | 5100 | 17 | 22 | 18950 |
| 200 | -1 | 95 | volvo | gas | std | four | sedan | rwd | front | 109.1 | 188.8 | 68.9 | 55.5 | 2952 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 9.5 | 114 | 5400 | 23 | 28 | 16845 |
| 201 | -1 | 95 | volvo | gas | turbo | four | sedan | rwd | front | 109.1 | 188.8 | 68.8 | 55.5 | 3049 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 8.7 | 160 | 5300 | 19 | 25 | 19045 |
| 202 | -1 | 95 | volvo | gas | std | four | sedan | rwd | front | 109.1 | 188.8 | 68.9 | 55.5 | 3012 | ohcv | six | 173 | mpfi | 3.58 | 2.87 | 8.8 | 134 | 5500 | 18 | 23 | 21485 |
| 203 | -1 | 95 | volvo | diesel | turbo | four | sedan | rwd | front | 109.1 | 188.8 | 68.9 | 55.5 | 3217 | ohc | six | 145 | idi | 3.01 | 3.4 | 23.0 | 106 | 4800 | 26 | 27 | 22470 |
| 204 | -1 | 95 | volvo | gas | turbo | four | sedan | rwd | front | 109.1 | 188.8 | 68.9 | 55.5 | 3062 | ohc | four | 141 | mpfi | 3.78 | 3.15 | 9.5 | 114 | 5400 | 19 | 25 | 22625 |